DataHub
Experience the full potential of your streaming and historical connected asset data with Cumulocity DataHub.
Reduce your overall data storage costs and work with the data lake of your choice. Integrate your own BI tooling, data models, and applications and efficiently query the data using open formats. Accelerate time-to-value by creating custom views on offloaded data to uncover valuable insights such as typical usage patterns and failure modes effortlessly
Learn from your IoT data
SQL queries on historical data
SQL-based Query Interface for querying the data lake enables you to connect any application that supports ODBC, JDBC, Apache Arrow Flight, or REST protocols.
Cost-efficient long-term storage
Create offloading pipelines into your preferred data lake. Save significantly on storage costs, up to 1000 times cheaper, enabling longer retention of data storage for AI/ML model training.
Cumulocity is a key component of our strategy to internalize and validate the data we’ve gathered over the last decade.
—Sébastien Trédan
Chief Technical and Data Officer
Greenflex
Incremental Extract-Transform-Store
Scheduled, regular data extraction from the Cumulocity operational storage, transformation into an optimized columnar format that is highly efficient for analytical queries, and incremental storage of your IoT data in the configured historical store.
Available from Cloud to Edge
Cumulocity DataHub Edge offers the same functionality as the cloud variant of Cumulocity DataHub, but stores the data locally using a NAS as a data lake.